Estimating and Optimizing Performance for Parallel Programs
نویسنده
چکیده
P 3 T is an interactive performance estimator that assists users in performance tuning of scientiic Fortran programs. It detects performance bottlenecks in the program, identiies the causes of performance problems, and guides the user in selecting eeective program transformations in order to gain performance. Four of the most critical performance aspects of parallel programs are estimated: load balance, cache locality, communication and computation overhead. P 3 T is an integrated tool of the Vienna Fortran Compilation System, which enables the estimator to aggressively exploit considerable knowledge about the compiler's analysis information and code restructuring strategies. We evaluate existing features and describe substantial enhancements in three key areas: graphical user interface, performance parameters and input programs. P 3 T 0 s graphical user interface directs the user to bottlenecks in a computation that prevent the program from performing well. In addition, it allows ltering and visualizing of performance data at various levels of detail. A new performance parameter, which models computation time is discussed. Furthermore, P 3 T has been extended to handle a subset of HPF programs. Experiments show the ability of P 3 T to successfully guide the user in performance tuning of parallel programs. This includes the selection of performance eecient data distribution strategies and prootable program transformations. The results are very encouraging, and demonstrate the feasibility of automatic performance prediction for parallel programs.
منابع مشابه
Parallel computation framework for optimizing trailer routes in bulk transportation
We consider a rich tanker trailer routing problem with stochastic transit times for chemicals and liquid bulk orders. A typical route of the tanker trailer comprises of sourcing a cleaned and prepped trailer from a pre-wash location, pickup and delivery of chemical orders, cleaning the tanker trailer at a post-wash location after order delivery and prepping for the next order. Unlike traditiona...
متن کاملSurvey of optimizing techniques for parallel programs running on computer clusters
In the current field of high performance computing, clusters technologies plays an ever increasing role. This paper tries to summarize state-of-the techniques for optimization of parallel programs designed to run on computer clusters. Optimizing parallel programs is a much harder task than optimizing sequential programs due to the increased complexity caused be communication and synchronization...
متن کاملP3T: An Automatic Performance Estimator for Parallel Programs
The area of parallelizing compilers for distributed memory multicomputers has seen considerable research activity during the last few years. Most of the current compilers do not provide any support for estimating performance impacts of code changes that they apply. In this paper, we present P 3 T, which is a static and automatic performance estimator for data parallel programs. It computes at c...
متن کاملDynamically Adaptive Parallel Programs
Dynamic program optimization is the only recourse for optimizing compilers when machine and program parameters necessary for applying an optimization technique are unknown until runtime. With the movement toward portable parallel programs, facilitated by language standards such as OpenMP, many of the optimizations developed for high-performance machines can no longer be applied prior to runtime...
متن کاملAN EFFECTIVE HYBRID METHOD FOR OPTIMIZING STEEL FRAMES WITH IMPROVED SEISMIC PERFORMANCE
The main purpose of this paper was to use a combination of Energy-based design method and whale algorithm (WOA), hereinafter referred to as E-WOA, to optimize steel moment frames and improve the seismic performance. In E-WOA, by properly estimating the seismic input energy and determining the optimal mechanism for the structure, steel frames are designed based on the energy balance method; acco...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- IEEE Computer
دوره 28 شماره
صفحات -
تاریخ انتشار 1995